A Hierarchical Model for Universal Schema Relation Extraction
نویسندگان
چکیده
Relation extraction by universal schema avoids mapping to a brittle, incomplete traditional schema by instead making predictions in the union of all input schemas, including textual patterns. Modeling these predictions by matrix competition with matrix factorization has yielded state-of-the-art accuracies. One difficulty with prior work in matrix factorization, however, is that there is no negative training data. As a result, existing methods merely sample an entity-pair’s unobserved relation types and assume they are negative. In this paper we instead maximize the likelihood of the observed data—achieving tractability by arranging the relation types as leaves in a binary tree. We show empirically that the choice of tree structure is consequential, and achieve a 2.61% F1 score improvement over the previous approach. Furthermore, a simple combination of this approach with the previous approach results in a 3.53% gain in F1 score.
منابع مشابه
Multilingual Relation Extraction using Compositional Universal Schema
Universal schema builds a knowledge base (KB) of entities and relations by jointly embedding all relation types from input KBs as well as textual patterns observed in raw text. In most previous applications of universal schema, each textual pattern is represented as a single embedding, preventing generalization to unseen patterns. Recent work employs a neural network to capture patterns’ compos...
متن کاملRow-less Universal Schema
Universal schema jointly embeds knowledge bases and textual patterns to reason about entities and relations for automatic knowledge base construction and information extraction. In the past, entity pairs and relations were represented as learned vectors with compatibility determined by a scoring function, limiting generalization to unseen text patterns and entities. Recently, ‘column-less’ vers...
متن کاملRelation Extraction with Matrix Factorization and Universal Schemas
Traditional relation extraction predicts relations within some fixed and finite target schema. Machine learning approaches to this task require either manual annotation or, in the case of distant supervision, existing structured sources of the same schema. The need for existing datasets can be avoided by using a universal schema: the union of all involved schemas (surface form predicates as in ...
متن کاملTowards Combined Matrix and Tensor Factorization for Universal Schema Relation Extraction
Matrix factorization of knowledge bases in universal schema has facilitated accurate distantlysupervised relation extraction. This factorization encodes dependencies between textual patterns and structured relations using lowdimensional vectors defined for each entity pair; although these factors are effective at combining evidence for an entity pair, they are inaccurate on rare pairs, or for r...
متن کاملLatent Relation Representations for Universal Schemas
Supervised relation extraction uses a pre-defined schema of relation types (such as born-in or employed-by). This approach requires labeling textual relations, a time-consuming and difficult process. This has led to significant interest in distantly-supervised learning. Here one aligns existing database records with the sentences in which these records have been “rendered”, and from this labeli...
متن کامل